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SYSTEM AND METHOD FOR PRELOADING CACHE MEMORY IN 
RESPONSE TO AN OCCURRENCE OF A CONTEXT SWITCH 

5 BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

The present invention generally relates to computer processing techniques and, 
in particular, to a system and method for enabling more efficient execution of a 
10 computer program by preloading, in response to an occurrence of a context switch, 
cache memory with data that is likely to be used to execute the computer program 
after the context switch. 

RELATED ART 

15 histructions of a computer program are typically executed by a processing unit 

within a computer system. To execute the instructions, the processing unit utilizes 
data that may be stored in various memory locations, such as cache memory, main 
memory, disk memory, etc. Cache memory, as known in the art, resides close to and 
often on the same board as the processing unit. Therefore, data usually can be written 

20 to and retrieved from cache memory faster than from memory outside of the 
processing iinit, such as main memory, disk memory, etc. 

When the processing unit executes an instruction that writes to or reads from 
an address in memory, the processing unit first analyzes the cache memory to 
determine whether the data of the address is stored in cache memory. If this data is 

25 stored in cache memory, then the instruction can be quickly executed. In this regard, 
if the executing instruction is a read instruction, then the foregoing data is retrieved 

^ EXPRESS MAIL NO. 



j&om cache memory. If the executing instmction is a write instruction, then the 
foregoing data is overwritten. 

However, if the data of the address to be written to or read from is not located 
in the cache memory, then the processing unit issues a retrieval request to request 
5 retrieval of the foregoing data. In response to the retrieval request, a memory system 
manager searches for the requested data in memory outside of the processing unit. 
While the memory system manager is searching for the requested data, any detected 
instruction writing to or reading from the address of the requested data is usually 
stalled in the processing unit. Once the requested data has been located by the 
10 memory system manager, the requested data is retrieved and stored in a cache line of 
the cache memory. As knovm in the art, a "cache line" is a set of one or more data 
values stored in the cache memory. Once the requested data is stored in a cache line 
of the cache memory, the stalls on any of the instructions writing to or reading from 
the address of the requested data can i:^ually be removed. 
1 5 To enable the processing unit to determine whether data utilized by an 

instruction is stored in the cache memory, the cache memory usually includes 
mappings that are correlated with the data stored in the cache memory and that 
identify memory locations outside of the processing unit from which the data was 
originally retrieved or to which the data is ultimately stored. In this regard, when a 
20 data value is retrieved from memory outside of the processing unit and stored in the 
cache memory, a mapping is generated that identifies the memory location from which 
the data value is retrieved and that identifies the cache memory location where the 
data value is stored in cache memory. The mappmg can later be used by the 
processing unit to determine whether or not the data value of a particular address is 
25 stored in cache memory. 
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For example, when the processing unit is executing an instruction that utilizes 
data stored at a particular address identifying a memory location outside of the 
processing unit, the processing unit can determine whether the required data is stored 
in cache memory by analyzing the mappings. If one of the mappings identifies the 

5 particular address, then the required data value has aheady been stored in the cache 
memory, and the processing unit, therefore, does not need to issue a retrieval request 
to retrieve the required data value from memory outside of the processing unit, 
histead, the processing unit can retrieve the data value stored at the cache memory 
location identified by the foregoing mapping. 

10 It is well known that the execution of a computer program by a processing unit 

can be temporarily stopped to allow the processing unit to execute instructions from 
another computer program. In a multitasking computer system, multiple computer 
programs are allowed to run in an interleaved fashion such that the multiple computer 
programs appear to run simultaneously to the user. To achieve this, the operating 

1 5 system alternates which programs are executed by the processing unit. 

For example, assume that a computer system is running two computer 
programs in an interleaved fashion. The processing unit in the computer system 
initially executes instructions from one of the programs. At some point, the operating 
system induces a context switch, m which the processing unit stops executing 

20 instructions from the first computer program and begins to execute instructions from 
the second computer program. Later, the operating system induces another context 
switch, in which the processing unit stops executing instructions from the second 
computer program and begins to execute instructions from the first computer program. 
The execution of instructions from the two computer programs is alternated in this 

25 way until one or both of the programs terminates or until another computer program is 



invoked. It should be noted that any niimber of computer programs can share the 
processing unit according to the foregoing techniques. 

While the processing unit is executing a program, certain information is 
maintained in control registers within the processing unit. Such information is 

5 commonly referred to as "the machine state." When a computer program is context 
switched out (i.e., when execution of the computer program is halted in response to an 
occurrence of a context switch), the machine state that existed at the time of the 
context switch is stored to memory outside of the processing unit. When this same 
computer program is later context switched in (i.e. , when execution of the computer 

10 program resumes in response to an occurrence of another context switch), the 

foregoing machine state is retrieved and loaded into the control registers so that the 
execution of the computer program may resume without errors. 

However, when a computer program is context switched in during a context 
switch, the cache memory usually includes data retrieved in response to execution of 

15 instructions from the previously executed computer program (i. e. , the computer 

program that is context switched out during the same context switch). Therefore, it is 
not likely that the cache memory contains data useful for the execution of the 
computer program that is context switched in. As a result, when instructions from this 
computer program are initially executed after the context switch, it is likely that 

20 numerous retrieval requests will be issued, and it is, therefore, likely that nxraierous 
stalls will occur. The stalls significantly impact the performance of the processmg 
xmit. Once a significant amount of data is stored in the cache memory in response to 
execution of the computer program that is context switched in, the number of retrieval 
requests is likely to decrease, and the number of stalls are, therefore, likely to 

25 decrease, as well. 
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Thus, a heretofore unaddressed need exists in the industry for providing a 
system and method of reducing the adverse impact of stalls that initially occur after a 
context switch. 



5 SUMMARY OF THE INVENTION 

The present invention overcomes the inadequacies and deficiencies of the prior 
art as discussed hereinbefore. Generally, the present invention provides a system and 
m.ethod for enabling more efficient execution of a computer program by preloading, in 
response to an occurrence of a context switch, cache memory with data that is likely to 

10 be used to execute the computer program after the context switch. 

In architecture, the system of the present mvention utilizes processing 
circuitry, cache memory, computer memory, and memory control circuitry. The 
processing circuitry executes computer programs in an interleaved fashion. 
Accordingly, the processing circuitry stops executing a computer program during a 

15 first context switch in response to a first context switch command. Later, the 

processing circuitry resumes executing the computer program during a second context 
switch in response to a second context switch command. 

The memory control circuitry, in response to the second context switch 
command, identifies an address of the computer memory that is storing a data value 

20 previously used to execute an instruction of the computer program prior to the first 
context switch. The memory control circuitry then retrieves the data value fiom the 
computer memory and stores the retrieved data value in the cache memory. 
Accordingly, the retrieved data value is available to the processing circuitry for use in 
executing instructions of the computer program after the second context switch 

25 without the processing circuitry having to request retrieval of the foregoing data value. 
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In accordance with another feature of the present invention, the memory 
control circuitry selects the foregoing data value for retrieval based on a determination 
that the retrieved data value was used to execute an instruction of the computer 
program within a specified time period before the first context switch. 

5 The present invention can also be viewed as providing a method for efficiently 

executing instructions of computer programs. The method can be broadly 
conceptualized by the following steps: executing instructions from a computer 
program; halting the executing step duruig a first context switch in response to a fnst 
context switch command; resuming the executing step during a second context switch 

10 in response to a second context switch command; maintaining a plurality of mappings; 
correlating, via the mappings, data values stored in a cache memory with memory 
addresses of memory locations outside of the cache memory; storing the mappings in 
the computer memory in response to said first context switch command; retrieving, 
based on the mappings and in response to the second context switch command, at 

15 least one data value fi-om at least one of the addresses identified by the mappings; and 
storing the one retrieved data value in the cache memory. 

Other features and advantages of the present invention will become apparent to 
one skilled in the art upon examination of the following detailed description, when 
read in conjunction with the accompanying drawings. It is intended that all such 

20 features and advantages be included herein within the scope of the present mvention 
and protected by the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention can be better understood with reference to the following 
25 drawings. The elements of the drawings are not necessarily to scale relative to each 

6 



other, emphasis instead being placed upon clearly illustrating the principles of the 
invention. Furthermore, like reference numerals designate corresponding parts 
throughout the several views. 

FIG. 1 is a block diagram illustrating a conventional computer system. 
5 FIG. 2 is a block diagram illustrating a more detailed view of a processing unit 

depicted in FIG. 1. 

FIG. 3 is a block diagram illustrating a computer system in accordance with 
the present invention. 

FIG. 4 is a block diagram illustratmg a more detailed view of a processing unit 
10 depicted in FIG. 3. 

FIG. 5 is a flow chart illustrating the architecture and functionality of the 
computer system depicted m FIG. 3 in executing computer programs. 

FIG. 6 is a flow chart illustrating the architecture and functionality of the 
computer system depicted in FIG. 3 in performing a context switch. 

15 

DF.TATT.Kn DESCRIPTION OF THE PREFERRED EMBODIMENT 

In general, the present invention provides a system and method for preloading 
cache memory in response to an occurrence of a context switch. The data preloaded 
into the cache memory is preferably associated with the program being context 
20 switched in. Therefore, when execution of the program begins after the context 

switch, it is likely that at least some of the data needed to execute the instructions of 
the program is already stored in the cache memory, resulting in fewer stalls. 

FIG. 1 depicts a conventional computer system 20. The computer system 20 of 
FIG. 1 comprises one or more processing units 25 designed to execute instructions of 
25 one or more computer programs 28 stored in main memory 3 1 or some other suitable 
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memory location. The processing unit 25 communicates to the other elements within 
the system 20 via a local interface 34, which can include one or more buses. 

An input device 37, for example, a keyboard or a mouse, can be used to input 
data fi-om a user of the system 20, and screen display 39 or a printer 42 can be used to 
5 output data to the user. A disk storage mechanism 45 can be cormected to the local 

interface 34 to transfer data to and from a nonvolatile disk (e.g., magnetic, optical, etc.). 
The system 20 can be connected to a network interface 47 that allows the system 20 to 
exchange data with a network 49. 

The computer system 20 also includes a memory system manager 52, which is 
1 0 designed to control the retrieval of data from and the storage of data to the memory 
resources, such as the main memory 3 1 and disk storage mechanism 45, for example. 
As an example, when the processing unit 25 produces data to be stored, the memory 
system manager 52 is designed to determine where the data is to be stored and to store 
the data in the appropriate address. Furthermore, when the processing unit 25 issues a 
1 5 retrieval request (i. e. , a request to retrieve data), the memory system manager 52 is 

designed to locate and retrieve the requested data. The memory system manger 52 then 
transmits the requested data to the processing unit 25 for fiirther processing, hi some 
embodmients, the memory system manager 52 is located within the processing unit 25. 
An operating system 55 confrols the allocation and usage of the processing unit 
20 25 and other resources of the computer system 20. The operating system 55 acts as an 
interface to the processing unit 25 and controls which data is transmitted to the 
processing unit 25. For example, the operating system 55 controls when a program 28 is 
executed by controlling when instructions from the program 28 are transmitted to and 
interfaced with the processing unit 25. The functionality and operation of the operating 
25 system 55 is generally well known in the art. 

8 
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FIG. 2 depicts a more detailed view of the processing unit 25. As shown by FIG. 
2, the processing unit 25 includes at least one pipeline 62 or other type of processing 
circuitry that receives and processes instructions from computer programs 28 via 
techniques generally well known in the art. The pipeline 62 is coupled to control 

5 circuitry 65, which controls the operation of the pipeline 62. Control circuitry 65 may 
control various aspects and may perform various fijnctionality. For example, the control 
cncuitry 65 may analyze the instructions being processed by the pipeline 62 and 
determine when the instructions should be stalled. When one or more of the instructions 
should be stalled, the control cncuitry 65 may transmit a disable signal to the pipeline 

10 62, which then prevents further processing or execution of at least one instruction in 
response to the disable signal. When the stall is no longer necessary, the control 
circuitry 65 may remove the disable signal, and the processing of the stalled instruction 
or instructions may resume. It should be apparent to one skilled in the art that there are 
various other functionality that may be performed by the control circuitry 65. 

15 As shown by FIG. 2, the processing unit 25 includes a plurality of data registers 

67 that are used via conventional techniques to execute the instructions being processed 
by the pipeline 62. For example, to add two numbers together, the pipeline 62 may 
execute a fnst instruction that loads a first data value into a fnst data register 67, and the 
pipeline 62 may execute a second instruction that loads a second data value into a 

20 second data register 67. Then, the pipeline 62 may execute a third instruction that adds 
the contents of the first and second data registers 67 and stores the result in a third data 
register 67. 

To adequately control the operation of the processing unit 25, the control 
chcuitry 65 keeps track of various information, which is stored by the control circuitry 
25 65 in one or more control registers 68. Such information is primarily used to control the 
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operation of the processing unit 25 and is included within the processing unit's machine 
state. 

The processing unit 25 also includes cache memory 72 that provides high speed 
access to data. In this regard, retrieving data from main memory 3 1 or other memory 
5 locations outside of processing unit 25 for use in executing instructions in the processing 
unit 25 takes a relatively long time. Therefore, when data is utilized in the execution of 
an instruction by the pipeline 62, it is desirable to store the data in cache memory 72. 
Since the cache memory 72 is located close to the other elements of the processing unit 
25, the amomt of time required to store data to and retrieve data from cache memory 72 

1 0 in executing the instructions of the processing vmit 25 is relatively short. By storing 

frequently used data in the cache memory 72, the amount of time required to access data 
used in the execution of many instructions can be significantly reduced, and the 
performance of the processing unit 25 can be thereby improved. 

When the pipeline 62 executes an instruction (e.g., a write instruction) that stores 

1 5 a data value in main memory 3 1 or other memory location outside of processing unit 25, 
the pipeline 62 fransmits the data value and the address where the data value is to be 
stored to cache memory control 75. In response to the data value and the memory 
address transmitted to the cache memory confrol 75, the cache memory control 75 is 
designed to determine whether data from the memory location identified by the 

20 foregoing memory address is stored in cache memory 72. 

As described in the Background of the Invention section, the cache memory 
confrol 75 maintains in the cache memory 72 mappings indicating which memory 
addresses outside of processing unit 25 are respectively correlated with the data values 
stored in the cache memory 72. If one of the mappings identifies the address fransmitted 

25 to the cache memory control 75 by the pipeline 62, the cache memory control 75 

10 



detennines that data from the memory location identified by the foregoing address is 
stored in cache memory 72. In response to such a determination, the cache memory 
control 75 is designed to overwrite the foregoing data in cache memory 72 with the data 
value transmitted to the cache memory control 75 from the pipeline 62. At some later 

5 time, the memory location identified by the foregoing address transmitted to the cache 
memory control 75 is updated with the data value stored in the cache memory 75 and 
correlated with the foregoing address via the aforementioned mappings. 

If none of the mappings in the cache memory 72 identifies the address 
transmitted to the cache memory control 75 by the pipeline 62, then the cache memory 

1 0 control 75 detennines that the data from the memory location identified by the foregoing 
address is not stored in cache memory 72. In response, the cache memory control 75 
transmits a retrieval request to memory system manager 112, which retrieves the data 
stored at the memory location identified by the foregoing address. This retrieved data is 
then stored in a cache line of the cache memory 72 by cache memory control 117, and 

1 5 the cache memory control 1 1 7 generates a new mapping indicating a correlation 
between this data value just stored in the cache memory 72 and the memory address 
from which the data value was retrieved. Then, the cache memory control 117 
overwrites this data value with the data value transmitted to the cache memory control 
117 from the pipeline 62. At some later time, the memory location identified by the 

20 foregoing address fransmitted to the cache memory control 75 by the pipeline 62 is 
updated with the data value stored in the cache memory 75 and correlated with the 
foregoing address via the mappings in the cache memory 72. 

When the pipeline 62 executes an instruction (e.g. , a read instruction) that 
retrieves a data value, the pipeline 62 transmits to the cache memory control 75 the 

25 address identifying the location outside of processing unit 25 where the requested data 

11 



value is stored. In response, the cache memory control 75 determines whether the 
requested data value is stored in cache memory 72. In this regard, the cache memory 
control 75 analyzes the mappings stored in cache memory 75. If one of the mappings 
identifies the address transmitted to the cache memory control 75 by the pipeline 62, 

5 then the cache memory control 75 determines that the requested data value is stored in 
the cache memory 72 and retrieves the requested data value by retrieving the data value 
at the location in cache memory 72 identified by the one mapping. 

If none of the mappings in the cache memory 72 identifies the address 
transmitted to the cache memory control 75 by the pipeline 62, the cache memory 

10 control 75 determines that the requested data value is not stored in the cache memory 
72. As a result, the data value should be retrieved fi:om memory outside of processing 
unit 25, such as main memory 3 1 . Therefore, cache memory control 75 issues a retrieval 
request, which includes the memory address of the location where the requested data 
value is stored. In response, memory system manager 52 retrieves the requested data 

1 5 value from a memory location outside of processing unit 25 based on the foregoing 
address and returns this data value to the processing unit 25. While the data value is 
being retrieved, the control circuitry 65 stalls any instructions in the pipeline 62 writing 
to or reading fi-om the foregoing address. 

As previously set forth in the Background of the Invention section, context 

20 switching can be used to enable multitasking, hi this regard, the operating system 55, at 
various times, may determine that the computer program 28 being executed by the 
processing unit 25 should be switched. In response to such a determination, the 
operating system 55 transmits a context switch command to the processing unit 25 and 
to the memory system manager 52 indicating that a context switch should occur. In 

25 particular, this context switch command indicates that a first program 28 being executed 

12 



by the processing unit 25 should be context switched out and that a second program 28 
{i.e., the next program 28 to be executed) should be context switched in. As used 
herein, a "context switch command" is any set of data that indicates a context switch 
should occur. A context switch command may include one or more instructions that, 

5 when executed by the processing unit 25, cause the processing unit 25 to perform a 

context switch. However, it is not necessary for the context switch command to include 
any instructions executable by the processing unit 25 provided that the processing unit 
25 is configured to perform context switchmg without such instructions being 
transmitted from the operating system 118. 

10 In response to the context switch command, the memory system manager 52 

stops transmitting instructions from the first program 28 being executed by the 
processing unit 25 and starts transmitting instructions from the second program 28. 
Furthermore, the control circuitry 65 transmits the contents of control registers 68 to 
memory system manager 52, which stores the contents in memory outside of processing 

15 system 25. 

At some later time, the operating system 55 determines that the first computer 
program 28, which was previously context switched out as described above, should be 
context switched in via another context switch. Therefore, the operating system 55 
transmits a context switch command to the processing unit 25 and to the memory system 

20 manager 52 indicating that a context switch should occur. In response, the program 28 
executing immediately prior to this context switch is context switched out, according to 
the techniques described above, and the first computer program is context switched in. 
In this regard, the aforementioned contents from control registers 68, which were stored 
in memory outside of processing unit 25 in response to the previously described context 

25 switch {i.e. , when the first program 28 was context switched out), are now retrieved 



from memory outside of the processing unit 25 by memory system manger 52 and are 
loaded into control registers 68. Then, instructions from the first computer program 28 
are transmitted to the processing unit 25 for execution. 

At this point, execution of the first program 28 resumes at a point where 

5 execution was interrupted when the first program 28 was previously context switched 
out. However, since other programs 28 have been executed by the processing unit 25 
since the first program 28 was previously context switched out, it is likely that the data 
presently stored in the cache memory 72 is associated with programs 28 other than the 
first program 28 being context switched in. Therefore, as the first program 28 executes, 

10 it is likely that numerous stalls will initially occur as the data utilized in the execution of 
the first program 28 is retrieved from memory stored outside of processing unit 25 and 
stored in the cache memory 72 in response to execution of instructions from the first 
program 28. 

FIG. 3 depicts a computer system 100 that employs a processing unit 110 
15 designed in accordance with the principles of the present invention. Like conventional 
computer system 20, system 100 preferably includes main memory 31, programs 28, a 
local interface 34, an input device 37, a display device 39, a disk storage mechanism 45, 
a printer 42, and a network interface 47 that may interface the system 100 with a 
network 49. The preferred embodiment of the system 100 also includes a memory 
20 system manager 112 and an operating system 118. Except as otherwise described 

hereinafter, the processing unit 110, the memory system manager 112 and the operating 
system 11 8 are configured identical to and operate the same as the processing unit 25, 
the memory system manager 52, and the operating system 55, respectively. 

The memory system manager 112 and the operating system 118 can each be 
25 implemented in software, hardware, or a combination thereof. In the preferred 



embodiment, as illustrated by way of example in FIG. 3, the memory system manager 
1 12 is implemented via hardware that is interfaced with the local interface 34, and the 
operating system 1 1 8 is implemented in software and stored in main memory 3 1 . 

As shown by FIG. 4, the processing xmit 110, similar to conventional processing 

5 unit 25, includes one or more pipelines 62 or other processing circuitry, data registers 
67, control registers 68, control circuitry 65, and cache memory 72. The processing unit 
110 also includes cache memory control 117. Except as otherwise described hereinafter, 
the cache memory control 1 17 is configured identical to and operates the same as 
conventional cache memory control 75. 

1 0 While a program 28 is executing, the cache memory control 117 tracks how 

often each cache line is utilized in executing the instructions of the program 28. 
Information indicating which cache lines in the cache memory 72 have been recently 
utilized is preferably stored as control data in the control registers 68, although this 
information may be stored in another location, if desired. This information is stored in 

1 5 memory outside the processing unit 110 when the foregoing program 28 is context 

switched out during a first context switch and is later loaded into the control registers 68 
when the foregoing program 28 is context switched in during a second context switch. 
Based on this information, the cache memory control 117 determines which data values 
were recently used in executing the program 28 prior to the first context switch {i.e. , 

20 prior to being context switched out) and issues data requests for these data values as 
execution of the program 28 is commenced after the second context switch. 

Therefore, the process for loading these data values into the cache memory 72 is 
initiated before these data values are requested via execution of instructions by the 
pipeline 62. In other words, the data values are preloaded into the cache memory 72, 

25 since they are loaded in response to the occurrence of the second context switch instead 

15 



of being loaded in response to an execution of an instruction by the pipeline 62. As a 
result, when an instruction that requests or utihzes one of the foregoing data values is 
processed by the pipeline 72, it is likely that the data value will akeady be stored in the 
cache memory 72, thereby preventing the need to retrieve the data value from memory 

5 outside of processing vmit 110. 

To better illustrate the principles of the present invention, assume that a first 
computer program 28 is being executed by the processing unit 110. As in conventional 
systems 20, data is stored in cache memory 72 as the first program 28 executes. One or 
more data values may be stored in each cache line of the cache memory 72. 

10 M the preferred embodiment of the present invention, each cache line is 

preferably correlated with a bit of information, referred to hereafter as a "utilization 
flag," which is stored in the control registers 68. The utilization flags are initially 
deasserted. While the first program 28 is executing, data is retrieved out of and stored 
into cache memory 72, according to the techniques previously described for 

1 5 conventional system 20, as represented by blocks 131-138 of FIG. 5 . However, 
whenever a data value is retrieved from or stored to cache memory 72, the cache 
memory control 117 is designed to assert the utilization flag correlated with the cache 
line containing the retrieved or stored data value, as indicated by blocks 142 and 144. 
The operating system 1 18 is preferably designed to periodically (e.g. , every ten 

20 milliseconds) fransmit a utilization flag clear command to the cache memory control 
117. In response to this command, the cache memory control 1 1 7 is designed to 
deassert each of the utilization flags, as shown by blocks 151 and 153. Therefore, the 
only utilization flags that should be asserted are the flags that are correlated with cache 
lines containing at least one data value that has been retrieved from or stored to cache 

25 memory 72 since the last received utilization flag clear command. Accordingly, only the 

16 



utilization flags correlated with cache lines containing at least one recently utilized (i.e. , 
retrieved or stored) data value should be asserted. 

At some point, the first program is likely to be context switched out during a 
context switch referred to hereafter as "the first context switch." When the first program 
5 28 is context switched out, the cache memory control 1 1 7 is designed to transmit a 
request to memory system manger 112 to store certain control information to memory 
outside of processing unit 1 10, as shown by blocks 156 and 158. Included in this 
control information are the utilization flags presently stored in the control registers 68 
and the mappings presently stored in the cache memory 72. These mappings indicate 

1 0 the addresses outside of processing unit 110 that are correlated with the data values 

presently in cache memory 72. During each context switch, each data value stored in the 
cache memory 72 is preferably stored in the memory address correlated with the data 
value by the mappings stored in the cache memory 72, as shown by block 159. 

After the first context switch is complete and the first program 28 is context 

15 switched out, one or more programs 28 may be sequentially executed by the processing 
unit 1 10. However, at some point, the first program 28 should be context switched in by 
the operating system 1 1 8 during a context switch referred to hereafter as "the second 
context switch." When this occurs, the aforementioned control data previously stored 
outside of processing unit 110 during the first context switch is provided to processing 

20 unit 1 1 0, as shown by block 1 72 of FIG. 6. In this regard, the memory system manager 
1 12, in response to a context switch command from the operating system 1 1 8, is 
designed to retrieve the control data and to transmit the control data to the processing 
unit 110 during the second context switch. The cache memory control 117 then stores 
the utilization flags in the control registers 68 and the mappings in the cache memory 72. 
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Then, the cache memory control 117 analyzes the utilization flags in order to 
select for preloading the data values that were recently retrieved from or stored to the 
cache memory 72 in response to an executing instruction just prior to the first context 
switch, as shown by block 175. To this end, each cache line correlated with an asserted 

5 utilization flag should have included at least one such data value before the first context 
switch. Based on the mappings now stored in the cache memory 72 (i.e. , the mappings 
stored in the cache memory 72 during the second context switch), the cache memory 
control 117 then issues one or more retrieval requests for the data value or data values of 
each cache line correlated with an asserted utilization flag. In response, the memory 

10 system manager 112 retrieves the requested data values, and these data values are then 
stored in the cache memory 72, as indicated by block 177. These data values should be 
the data values that were stored in the cache lines correlated with asserted utilization 
flags just prior to the first context switch. In other words, these data values should be 
data values that were recently utilized in executing the first program 28 just prior to the 

15 first context switch. 

The retrieval of the foregoing data values can be commenced prior to or during 
the execution of the first program 28 by the processing unit 110 that occurs subsequent 
to the second context switch. Therefore, before or during execution of the first program 
28 that occurs subsequent to the second context switch (i.e. , after the first program has 

20 been context switched in), the data values recently used in the execution of the first 
program 28 prior to the first context switch (i.e., prior to the first program 28 being 
context switched out) are preloaded into the cache memory 72 as shown by blocks 177 
and 179. In other words, the data values recently used in the execution of the first 
program 28 prior to the first context switch are loaded into the cache memory 72 in 

25 response to the occurrence of the second context switch. Therefore, it is likely that 



many of the data values initially manipxilated (i.e., retrieved or overwritten) by execution 
of the first computer program 28 after the second context switch will already be stored 
in the cache memory 72 before these data values are requested via the execution of 
instructions utilizing the data values. As a result, the number of stalls that initially occur 
5 after first program 28 is context switched in are likely to be reduced. 

It should be noted that different data values may be selected for preloading into 
the cache memory 72 in other embodiments. For example, when a program 28 is 
context switched in, it is possible to preload into the cache memory 72 all of the data 
values that were previously stored in the cache memory 72 when the program 28 was 

10 previously context switched out. However, such an embodiment is not optimal, since 
many data values that were not recently used prior to the associated context switch out 
would be preloaded. 

It should be emphasized that the above-described embodiments of the present 
invention, particularly, any "preferred" embodiments, are merely possible examples of 

15 implementations, merely set forth for a clear understanding of the principles of the 
invention. Many variations and modifications may be made to the above-described 
embodiment(s) of the invention without departing substantially ftom the spirit and 
principles of the invention. All such modifications and variations are intended to be 
included herein within the scope of this disclosure and the present invention and 

20 protected by the following claims. 
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Now, therefore, the following is claimed: 

1 1 . A computer system for efficiently executing instructions of computer 

2 programs, comprising: 

3 processing circuitry configured to execute instructions from one of a plurality 

4 of programs, said processing circuitry further configured to stop executing said one 

5 program during a first context switch in response to a first context switch command 

6 and to resume executing said one program during a second context switch in response 

7 to a second context switch command; 

8 cache memory; 

9 computer memory having a plurality of addresses; and 

10 memory control circuitry coupled to said processing circuitry, said memory 

1 1 control circuitry, in response to said second context switch command, configured to 

12 identify one of said addresses of said computer memory that is storing a data value 

13 previously used to execute an instruction of said one computer program prior to said 

14 first context switch, said memory control circuitry further configured to retrieve said 

15 data value fi-om said computer memory in response to said second context switch 

16 command and to store said retrieved data value ui said cache memory. 

1 2. The system of claim 1, wherein said processing circuitry is further 

2 configured to execute instructions of another of said computer programs in response 

3 to said first context switch command. 
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1 3 . The system of claim 1 , wherein said memory control circuitry is further 

2 configured to determine, in response to said second context switch command, whether 

3 said data value was utilized by said processing circuitry to execute an instruction 

4 within a specified time period prior to said first context switch. 

1 4. The system of claim 1, wherein said memory control circuitry is 

2 configured to maintain a plurality of mappings, each of said mappings respectively 

3 correlating a data value stored in said cache memory with one of said memory 

4 addresses of said computer memory, said memory control circuitry further configured 

5 to maintain a bit of information that is associated with one of said mappings, said 

6 memory control circuitry configured to assert said bit when a data value correlated 

7 with a computer memory address via said one mapping is utilized to execute an 

8 instruction of said one program, said memory control circuitry fiirther configured to 

9 deassert said bit periodically. 

1 5 . The system of claim 4, wherein said memory control circuitry is fiirther 

2 configured to determine, in response to said second context switch command and 

3 based on said bit, whether said data value was recently utilized by said processing 

4 circuitry to execute an instruction prior to said first context switch. 

1 6. The system ofclaim 5, wherein said memory control circuitry is further 

2 configured to store said mappings and said bit to said computer memory in response 

3 to said first context switch command and to retrieve said mappings and said bit from 

4 said computer memory in response to said second context switch command. 
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1 7. A computer system for efficiently executing instructions of computer 

2 programs, comprising: 

3 processing circuitry configured to execute instructions from one of a plurality 

4 of programs, said processing circuitry further configured to stop executing said one 

5 program during a first context switch in response to a first context switch command 

6 and to resume executing said one program during a second context switch in response 



7 to a second context switch command; 

8 cache memory; 

9 computer memory having a plurality of addresses; and 

10 memory control circuitry coupled to said processing circuitry, said memory 



1 1 control circuitry configured to maintain a plurality of mappings, said mappings 

12 respectively correlating data values stored in said cache memory with said memory 

13 addresses of said computer memory, said memory control circuitry configured to store 

14 said mappings in said computer memory in response to said first context switch 



15 command and to retrieve data values from said addresses that are identified by said 

16 mappings stored in said computer memory in response to said second context switch 

17 command, said memory confrol circuitry fiirther configured to store in said cache 

18 memory said retrieved data values. 

1 8. The system of claim 7, wherein said processing circuitry is further 

2 configured to execute instructions of another of said computer programs in response 

3 to said first context switch command. 
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1 9. The system of claim 7, wherein said memory control circuitry is further 

2 configured to maintain utilization data indicative of which of said memory addresses 

3 are storing data values accessed within a specified time period prior to said first 

4 context switch, and wherein said memory control circuitry, based on said mappings 

5 and said utilization data, is fiirther configured to select for retrieval data values 

6 identified by one of said mappings and accessed within said specified time period, 

7 wherein each of said retrieved data values is a data value selected by said memory 

8 control circuitry based on said utilization data. 

1 1 0. The system of claim 9, wherein said memory control circuitry is further 

2 configured to store said utilization data in said computer memory in response to said 

3 first context switch command and to retrieve said utihzation data and said mappings 

4 in response to said second context switch command. 

1 11. The system of claim 9, wherein said utilization data is a plurality of 

2 bits respectively associated with said mappings, wherein said memory control 

3 circuitry, for each data value accessed by said memory control circuitry, is configured 

4 to assert the bit associated with the mapping that correlates said each data value with 

5 one of said computer memory addresses, and wherein said memory control circuitry is 

6 configured to periodically deassert each of said plurality of bits. 
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1 1 2. A method for efficiently executing instructions of computer programs, 

2 comprising the steps of: 

3 executing a plurahty of computer programs in an interleaved fashion; 

4 switching which of said computer programs is being executed in said 

5 executing step; 

6 storing, prior to said switching step, at an address in computer memory a data 

7 value utilized in said executing step; 

8 identifying said address in response to said switching step; 

9 retrieving said data value from said address based on said identifying step and 

10 in response to said switching step; 

1 1 storing said retrieved data value in cache memory; and 

12 retrieving said data value from said cache memory in response to said 

13 executing step. 

1 13. The method of claim 12, wherein said executing step further includes 

2 the step of executing instructions of a computer program in response to said switching 

3 step, and wherein said method fmther comprises the steps of: 

4 determining that said address is storing a data value previously utilized in said 

5 executing step to execute an instruction of said computer program; and 

6 performing said identifying step based on said determining step. 
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1 14. The method of claim 12, further comprising the steps of: 

2 correlating, respectively, data values stored in said cache memory with 

3 addresses of said computer memory; 

4 asserting a bit each time a data value correlated with said address identified in 

5 said identifying step is accessed in response to said executing step; and 

6 periodically deasserting said bit. 

1 15. The method of claim 14, wherein said executing step further includes 

2 the step of executing instructions of a computer program in response to said switching 

3 step, and wherein said method further comprises the steps of: 

4 determining, based on said bit, that said address identified in said identifying 

5 step is storing a data value previously utilized in said executing step to execute an 

6 instruction of said computer program; and 

7 performing said identifying step based on said determining step. 



25 



HP Docket No. 10001219 



1 1 6 . A method for efficiently executing instructions of computer programs, 

2 comprising the steps of: 

3 executing instructions from a computer program; 

4 halting said executing step during a first context switch in response to a first 

5 context switch command; 

6 resuming said executing step during a second context switch in response to a 

7 second context switch command; 

8 maintaining a plurality of mappings; 

9 correlating, via said mappings, data values stored in a cache memory with 

10 memory addresses of computer memory outside of said cache memory; 

1 1 storing said mappings in said computer memory in response to said first 

12 context switch command; 

13 retrieving, based on said mappings and in response to said second context 

14 switch command, at least one data value firom at least one of said addresses identified 

15 by said mappings; and 

16 storing said at least one retrieved data value in said cache memory. 

1 17. The method of claim 16, further comprising the steps of: 

2 maintaining utilization data indicative of which of said memory addresses are 

3 storing data values accessed within a specified time period prior to said first context 

4 switch; and 

5 selecting, based on said mappings and said utilization data, data values 

6 accessed within said specified time period, 

7 wherein said retrieving step includes the step of retrieving each data value 

8 selected in said selecting step. 
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1 18. The method of claim 17, further comprising the steps of: 

2 storing said utilization data in said computer memory in response to said first 

3 context switch command; and 

4 retrieving said utilization data and said mappings in response to said second 

5 context switch command. 



1 19. The method of claim 17, wherein said utilization data is a plurality of 

2 bits respectively associated with said mappings, and wherein said method further 

3 comprises the steps of: 

4 asserting each of said bits associated respectively with each of said mappings 

5 that identifies a data value accessed in response to said executing step; and 

6 periodically deasserting each of said bits. 
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ABSTRACT OF THE DISCLOSURE 

In a multitasking computer system, data is preloaded into cache memory upon 
the occurrence of a context switch. To this end, processing circuitry stops executing a 
computer program during a first context switch in response to a first context switch 
5 command. Later, the processing circuitry resumes executing the computer program 
during a second context switch in response to a second context switch command. The 
memory control circuitry, in response to the second context switch command, 
identifies an address of computer memory that is storing a data value previously used 
to execute an instruction of the computer program prior to the first context switch. 
1 0 The memory control circuitry then retrieves the data value from the computer memory 
and stores the retrieved data value in the cache memory. Accordingly, the retrieved 
data value is available to the processing circuitry for use in executing instructions of 
the computer program after the second context switch without the processing circuitry 
having to request retrieval of the foregoing data value. 
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